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Abstract. Artin solved Hilbert's 17 th problem by showing that every positive semidef- 
inite polynomial can be realized as a sum of squares of rational functions. Pfister gave 
a bound on the number of squares of rational functions: if p is a positive semi-definite 
polynomial in n variables, then there is a polynomial q so that q 2 p is a sum of at most 
2 n squares. 

As shown by D'Angelo and Lebl, the analog of Pfister's theorem fails in the case of 
Hermitian polynomials. Specifically, it was shown that the rank of any multiple of the 
polynomial ||z|| 2d = (X^ kj| 2 ) d is bounded below by a quantity depending on d. Here we 
prove that a similar result holds in a free *-algebra. 



1. Introduction 

The aim of this section is to define the main objects and to review some related work. 
We work in the real free *-algebra M{X,X*) generated by the n noncommuting (NC) 
variables X\ , . . . , X n and their adjoints X* . After taking a representation we can think of 
these variables as real square matrices, and the * function on M(X,X*) as the transpose 
operation. In particular, * respects addition and multiplication by scalars and is defined on 
monomials by (X^ ■ ■ ■ Xj k )* = X* k ■ ■ ■ X* ± and (X*)* = Xj. We use multi-indices a, tuples 

of non-negative integers from to 2n, to index monomials: X a = X ai X a , 2 ■ ■ ■ X ak . X® is 
simply the empty word, denoted by 1. For < j < n, we define Xj +n = XJ. We define 
conjugation and concatenation of multi-indices a and (3 by the equations X a = (X a )* 
and X a °P = X a X' 3 . 

Evaluation of p E M.(X,X*) at a tuple (Mi, . . . , M n ) of square matrices of the same size is 
defined by the substitution of Mj for Xj and Mj for X* . 

We say that p E R(X, X*) is symmetric when p* = p. Such a polynomial p is said to be 
matrix positive if the matrix p(M) is positive semidefinite (or PSD) for every tuple M 
of square matrices. It was shown by Helton in [3] that every matrix positive polynomial 
is a sum of squares (SOS). The minimal number of squares required to express a matrix 
positive polynomial as a sum of squares is not known in general, although upper bounds 
are easy to obtain. The question is open in the commutative case as well, and in both cases 
amounts to a problem of rank minimization. A great many types of rank minimization 
problems have been successfully attacked in recent years with semidefinite programming 
techniques (see [6] for examples). A complete characterization of conditions for success 
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of the nuclear norm approach, or "trace-heuristic", in this context is not know, though 
Recht provided in [7] a probabilistic characterization of success for particular classes of 
rank minimization problems. 

Optimization in certain quantum physics problems is done over feasible regions of operators 
on Hilbert spaces, and so NC variables are useful there. Several examples and a general 
framework for such problems are presented in [5], where the semidefinite programming 
relaxations of Lasserre are extended to the NC setting. Motivation for the study of NC 
polynomials from control theory is discussed in [2j. 

2. POLYNOMIALS, ASSOCIATED MATRICES AND SUMS OF SQUARES 

To any symmetric polynomial p G M(X,X*) we can associate a real, symmetric matrix M 
with the property 

V*MV = p 

where V* = (X a )\ a \<di an d V is the column vector (X a )|j<,j (with the monomials in 
graded lexicographical order). The matrix M is not unique, in fact the set of all such 
matrices (for a fixed p) forms an affine space which we will denote M p . 

By the rank of p, we mean the minimum of rank(M) over all M 6 M p - For a positive 
polynomial, this minimum is to be taken over only the PSD matrices. The following lemma 
helps us obtain a lower bound on rank 

Lemma 2.1. If A is a symmetric matrix satisfying V* AV = 0, then the (2n) d x (2n) d 
lower right submatrix of A is the zero matrix. 

Proof. Let B denote the block in question, and V the tautological vector of just the mono- 
mials of degree d. Then V* AV = implies that V*BV = as well since the product 
V*BV yields exactly the degree 2d terms of the polynomial V* AV . But the entries of B 
are exactly the coefficients of the distinct monomials in V*BV, hence B is the zero matrix. 

The lemma above shows that there is no freedom in choosing the block corresponding to 
the degree 2d terms of the polynomial. Since the rank of this block gives a lower bound on 
the rank of the whole matrix, taking the block to be the the (2n) d x (2n) d identity yields 
a polynomial with rank at least (2n) d . 

2.1. Positive polynomials and sums of squares. In the commutative case it is well- 
known that the cone of positive polynomials properly contains the SOS cone. Motzkin's 
polynomial M(x, y) = 1 + x 2 y 4 + y 2 x 4 — 3x 2 y 2 is the first known example of a positive 
polynomial outside the SOS cone, and was discovered decades after Hilbert proved the 
existence of such polynomials. 
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In contrast, the NC setting offers the nice result, proved by Helton in [3], that any positive 
polynomial is a sum of squares. Here, a square takes the form /*/, so that obviously 
a sum of squares is positive in the sense defined above. In order to understand the SOS 
representation of a positive polynomial, we use the matrix representation introduced above. 
The following lemma leads us to the semidefinite programming formulation of the rank 
minimization problem. 

Lemma 2.2. A polynomial p is matrix positive exactly when it can be expressed p = 
V*MV , with M a PSD matrix. The rank of p is exactly the minimum number of squares 
over all SOS representations of p. 

The proof is straightforward. It follows that the minimum number of squares for a positive 
p is 



min rank X 
s.t. V*XV = p, 

which can be calculated efficiently (but not always accurately), by minimizing instead the 
trace of X. 

As a simple example of this problem consider the polynomial P = 1 + X*X + XX*, clearly 
a SOS. The polynomial P is a sum of 3 squares, but can be expressed as a sum of 2 squares 
(and no fewer) . To see why we parameterize the affine space A4p by the single parameter 
tel. As usual V = (1,X, X*) T , and so P = V*V = X*X + XX* + 1. Defining 




we get A4p = {I + tM\t G R}, and find the minimal SOS representation 




on the boundary of the region where I + tM y 0. Note that in this example trace is 
constant on Mp Pi PSD, and that the given solution is obtained by maximizing t over 

{t\i + tM y o}. 
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3. The Examples 

Pfister's Theorem gives a bound on the number of rational functions in the SOS represen- 
tation of a PSD polynomial. The bound is remarkable because it does not depend on the 
degree of the polynomial in question. D'Angelo and Lebl proved in pQ that this result fails 
for Hermitian polynomials. We'll show that it fails for noncommutative polynomials. The 
first theorem below is needed for the second. It is easy to check that the polynomial S 
below has rank (2n) d , but more is true. 

Theorem 3.1. Suppose that q E R(X,X*} and define S = Y.\ a \=d Xa * X ° ■ Thenp= q*Sq 
has rank at least (2n) d . Here, (2n) d is the dimension of span{X a }\ a \ =f j 

Proof. Since p is matrix positive, it is a sum of squares, and so we may write p = V*MV, 
appending V with the necessary monomials. Let q be such that q*Sq = p, and write 
q = YlalaX 01 ■ Let & De maximal, with respect to lexicographical ordering, among all a 
such that q a ^ 0. 

We have V*MV =p = q*Sq = q*(E\ a \=d X a * X a )q = £ N=d (X a g)*(X a g). F ° r each a, 
write X a q = Q a V, where Q a is the row vector of the coefficients of X a q. Forming the 
matrix Q whose rows are the Q a we get V*MV = p = V*Q*QV, hence V*{M-Q*Q)V = 0. 

The polynomials X a q form a linearly independent set, and in fact have the distinct leading 
terms ^I aoa . It follows that the last (2n) d+de9 ^ columns of Q form a block of rank at 
least (2n) d . Writing Q in block form Q = [A B] where B is a (2n) d x (2n) d+de9 ^ matrix, 
we compute 



p = V*Q T QV = V* 



A T 
B T 



[A B] V = V* 



A T A 
B T A 



A T B 
B T B 



V. 



The V above includes all monomials up to degree {2n) d+de9 ^ q \ Since V*{M - Q*Q)V = 0, 
we know from the lemma that M cannot differ from Q*Q in its (2n) d+de9 ^ x (2n) d+de9 ^ 
lower right block; this block equals B T B. Therefore M, an arbitrary matrix representation 
for p, has rank at least (2n) d . □ 



Alternatively, one might ask whether a Pfister's Theorem holds for products of the usual 
form. Consider what it would take for q*qS to be a SOS. Because q*q is symmetric, we 
note that since SOS are symmetric we must have q*qS = (q*qS)* = Sq*q, so that q*q 
and S commute. Since we evaluate these polynomials on tuples of matrices, it is tempting 
to treat them as symmetric matrices. In particular, one might guess that if two of them 
commute, then they are both polynomials in a third polynomial. This happens to be true, 
and it follows from the following more general theorem from combinatorics: 
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Theorem 3.2. (Bergman's Centralizer Theorem) Let K be a field, and K(X) the ring 
of polynomials over K in noncommuting variables X±, . . . , X n . Then the centralizer of a 
nonscalar element in K(X) is isomorphic to K[t] for a single variable t. 

The proof is a bit lengthy and can be found in [3]. It uses the fact that such a centralizer 
is integrally closed in its field of fractions together with an easier result in the formal series 
setting: 

Theorem 3.3. (Cohn's Centralizer Theorem) Let K be a field and K((X)) the ring for 
formal power series over K in noncommuting variables X\, . . . ,X n . Then the centralizer 
of a nonscalar element in K((X)) is isomorphic to K[t] for a single variable t. 

These theorems apply despite the superficial difference that we are working with indeter- 
minates X\, . . . , X n ,X*, . . . , X* for which (X*)* = Xi\ there are no polynomial relations 
among them, and so we can regard them as 2n noncommuting variables Yi, . . . , Y<m- Armed 
with Theorem 3.2, we are ready to give the counterexample: 

Theorem 3.4. If p £ M.{X, X*) , a matrix positive polynomial, is of the form q*qS with 
S = T,\ a \=d Xa * X ° > then rank(p) > (2n) d . 

Proof. We will use the previous Theorem 3.1 together with Bergman's Centralizer Theorem. 
The main difficulty lies in showing that under the hypotheses, q*q is actually a polynomial 
in S. 

Invoking the centralizer theorem we write q*q = f(h(X,X*)) and S = g(h(X, X*)) for 
h(X,X*) G R(X,X*) and f(t),g(t) G R[t}. It follows from the equation S = g(h(X,X*)) 
that g must have degree 1. To see why, write 



h(X, X*) = ciX ai + . . . + c\X ai + (lower degree terms), g(t) = a k t k + . . . + o 

with Cj, ai G K. We note that each term X a h ■ ■ ■ X a ^ is symmetric since it must be one 
of the monomials X a * X a in S. Supposing k > 1, we have always that a.j x = a* fc . This 
implies that there is just one cij, which is certainly not the case. Therefore deg(<?) = 1 and 
we write g(t) = at + b so that 5 = g(h(X, X*)) = ah(X, X*) + b or h(X, X*) = l/a(S - b). 

Now we have q*q = f(\/a(S — b)) = r(S) for some polynomial r(t) G R[i]. Since r(S) 
has rank equal to l(it can be expressed as a single noncommutative square), it follows 
that r(t) is of even degree. If not, write r(t) = r2k+\t 2k+1 + • • • + tq with r^k+x 7^ 0. Then 
r(S) = r2k+\S 2k+1 + (lower degree terms) and we have by Theorem 3.1 that S 2k+l = S k SS k 
and therefore r(S) itself has rank at least (2n) d > 1, a contradiction. Finally, tr(t) has odd 
degree and therefore another application of Theorem 3.1 lets us conclude that p = Sr(S) 
has rank at least (2n) d . □ 
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